PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds

نویسندگان

  • Jun Ogata
  • Masataka Goto
چکیده

This paper presents a language-model training method for improving automatic transcription of online spoken contents. Unlike previously studied LVCSR tasks such as broadcast news and lectures, large-sized task-specific corpora for training language models cannot be prepared and used in recognition because of the diversity of topics, vocabularies, and speaking styles. To overcome difficulties in preparing such task-specific language models in advance, we propose collaborative training of language models on the basis of wisdom of crowds. On our public web service for LVCSR-based spoken document retrieval PodCastle, over half a million recognition errors were corrected by anonymous users. By leveraging such corrected transcriptions, component language models for various topics can be built and dynamically mixed to generate an appropriate language model for each podcast episode in an unsupervised manner. Experimental results with Japanese podcasts showed that the mixed languages models significantly reduced the word error rate.

منابع مشابه

Podcastle: collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription

This paper presents acoustic-model-training techniques for improving automatic transcription of podcasts. A typical approach for acoustic modeling is to create a task-specific corpus including hundreds (or even thousands) of hours of speech data and their accurate transcriptions. This approach, however, is impractical in podcast-transcription task because manual generation of the transcriptions...

متن کامل

The Effect of Summary Training on Intermediate EFL Learners’ Reading Comprehension in Individual and Collaborative Conditions

Inspired by Vygotsky’s Sociocultural Theory (SCT), the current study intended to investigate the effect of summary training (i.e., oral and written) on intermediate EFL learners’ reading comprehension in different conditions (i.e., individual and collaborative). Data collection tools and procedures encompassed PET test, First Certificate in English (FCE) reading pre-test, and post-test. First, ...

متن کامل

Interactional complexity development, interactional demonstrators and interaction density in collaborative and e-collaborative writing modalities

This study aimed at investigating the potential of collaborative and e-collaborative writing modalities in developing interactional complexity, utilization of interactional demonstrators and density of interaction. To this end, 66 Iranian intermediate female English as foreign language learners (EFL) were selected to participate in this study according to their scores on Oxford Placement Test (...

متن کامل

Podcastle: Improvements of Speech Recognition by Using Acoustic Modeling Based on Wisdom of Crowds

1 はじめに 我々は,ポッドキャストを音声認識によって 自動的にテキスト化することで,それらをユー ザが全文検索できるだけではなく,詳細な閲覧, 編集も可能なソーシャルアノテーションシステ ム「PodCastle1)2)3)」の開発,運営を行っている. ポッドキャストは実環境の多様な音声データであ り,従来の音声認識技術では高い認識率を達成す ることは難しい.そこで PodCastleでは,多数 のユーザに認識誤りを訂正 (アノテーション)す る協力をしてもらうことで,音声認識率をシス テムの運用中に向上させる枠組みを採用してい る.こうすることで,検索サービスとしての質を 向上させるだけでなく,音声認識技術の底上げを はかることも狙っている. 本研究では,上記の枠組みの一環として,PodCastleを通じて得られる集合知,すなわちユー ザによる音声認識誤りの訂正結果を活用した音 響...

متن کامل

DISTRIBUTED AND COLLABORATIVE FUZZY MODELING

In this study, we introduce and study a concept of distributed fuzzymodeling. Fuzzy modeling encountered so far is predominantly of a centralizednature by being focused on the use of a single data set. In contrast to this style ofmodeling, the proposed paradigm of distributed and collaborative modeling isconcerned with distributed models which are constructed in a highly collaborativefashion. I...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012